Adaptive separation of acoustic sources for anechoic conditions: A constrained frequency domain approach
نویسندگان
چکیده
Blind source represents a signal processing technique with a large potential for noise reduction. However, its application in modern digital hearing aids poses high demands with respect to computational efficiency and speed of adaptation towards the desired solution. In this paper, an algorithm is presented which fulfills these goals under the idealized assumption that the superposition of sources in rooms can be approximated as a superposition under anechoic conditions. Specifically, attenuation, the signals’ finite propagation speed, and diffuse noise are accounted for, whereas reflections and reverberation are considered as negligible effects. This approximation is referred to as the ‘free field’ assumption. Starting from a general blind source separation algorithm for Fourier transformed speech signals, the free field assumption is incorporated into the framework, yielding a simple, fast and adaptive algorithm that is able to track moving sources. Implementation details are given which were found to be indispensable for fast and robust signal separation. Performance is evaluated both by simulations and experimentally, including separation of a moving and a fixed speaker in a recorded real anechoic environment. The potential benefits and shortcomings of this algorithm are discussed with regard to its inclusion into the signal processing framework of digital hearing aids for real reverberant acoustic situations.
منابع مشابه
On the Window-disjoint-orthogonality of Speech Sources in Reverberant Humanoid Scenarios
Many speech source separation approaches are based on the assumption of orthogonality of speech sources in the time-frequency domain. The target speech source is demixed from the mixture by applying the ideal binary mask to the mixture. The time-frequency orthogonality of speech sources is investigated in detail only for anechoic and artificially mixed speech mixtures. This paper evaluates how ...
متن کاملFundamental Limitation of Frequency Domain Blind Source Separation for Convolved Mixture of Speech
Despite several recent proposals to achieve Blind Source Separation (BSS) for realistic acoustic signals, the separation performance is still not enough. In particular, when the length of an impulse response is long, the performance is highly limited. In this paper, we consider the reason for the poor performance of BSS in a long reverberation environment. First, we show that it is useless to b...
متن کاملOverdetermined Blind Separation of Acoustic Signals Based on MISO-Constrained Frequency-Domain ICA
We propose a new overdetermined blind source separation (BSS) using frequency-domain independent component analysis (FDICA) based on multiple-input singleoutput (MISO) constraint. To achieve a superior separation performance under reverberant environments, we set the number of microphones to be larger than that of sources. This leads to alternative problems in which the sound qualities of the s...
متن کاملA Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement
A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...
متن کاملشبیهسازی عددی پوشش های ضد اکو حفره دار با استفاده از ترکیب روشهای اجزاء محدود و کانال آکوستیکی
The absorption performance of anechoic coatings depends on the material properties, layer thicknesses and cavity distribution density and cavity size. In this paper a design method based on numerical simulation was presented by combining FEM and acoustic duct method (ADM). Analyzing of anechoic coatings was performed under active sonar impinging plane wave by normal incident angle. In thi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 39 شماره
صفحات -
تاریخ انتشار 2003